CDS
Accession Number | TCMCG075C05622 |
gbkey | CDS |
Protein Id | XP_007042474.2 |
Location | complement(join(5217584..5218087,5218519..5218579,5218668..5221089,5221239..5221380,5221519..5221617)) |
Gene | LOC18607964 |
GeneID | 18607964 |
Organism | Theobroma cacao |
Protein
Length | 1075aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007042412.2 |
Definition | PREDICTED: filament-like plant protein 7 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | S |
Description | Filament-like plant protein |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko04147 [VIEW IN KEGG] ko04812 [VIEW IN KEGG] |
KEGG_ko |
ko:K10352
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04530
[VIEW IN KEGG] map04530 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGACCACAAAACGTGGCTTTGGCGGAAAAAATCTTCTGAGAAGACAATTGTTGCTACTGACAAGGTTGACATGTCTTTGAAACGAATTGATGAAGAGGTACAAATGCCTCCGATGGAGGGGCCACGAGATAGAATAGTGAAAAATCTAAATGAGAAGCTTGCTTCAGTCCTCCTTGATTGTCATGCTAAAGAGGATCTGGTGACAAAAAATGTGAAAATGGCACCAGAAGCAAATGCAGGTTGGGAAAAGGCAGAAGCAGATGCAATCTTTCTGAAGAAAGAGCTAGAAGAAGCTTTGAGGCAGGGAAAATTGGCAAATGAAAAGTTAACTCGCTCAGATGCTGCGTTGAAGGAATGTATGCAACAGCTAAATTTTTTTAGAGAAGAGCAGGAGCAAAGGATGCGTGATGCTATCATGAAGACATCAAGTGAGTTTGAGAAAGCACAGGAATCATTACAAGACAAGCTGACAGAGACAAACAGAAGGCTTGAAGAGTTGGTGGTTGAGAATTCTCGACTGAGTAAGGCCCTGCTAGTCAAAGAAAAATTGATTGAAGATCAGCAGAAGCACAAGTCTCAGGCAGAAGCAGAATTTGGTGCACTAATGGCTAGATTAGATTTCACTGAAAAGGAAAATACATTTTTGAAATATGAGTTTCATGTCCTCGAGAAGGAGCTTGAGATCCGAAATGAAGAGATGGAATACAACCGTCGATCAGCTGACTTAGCACATAAGCAACATTTGGATGGTGTAAAGAAAATCGCAAAGTTGGAAGCAGAATGCCAGAAGTTACGTCTCCTTCTGCAAAAGAGGCTGCCAGGTCCTGCTGCTGTGATGAAAATGAAGAATGAAGTTGAAATGCTGGGGAGGGACAAGACAGAGCTGAGAAGAAGAAAGTTGAATTCCACTAGAGATTTGATTATCAGAGACTCTGCCACTGAAAATTCTCCTGATAATCCAACTAAGAACATCAATCTCCTGTTAGAGCAATTACGCAATGTGGAAGAAGAAAACAGGACTCTCAAAGAAATCATGACCAAGAAAAATGCTCAACTCCAGTCATCAAGTTTAGCATGCTCTCAAACATTATCCAGGCCAACACAGGTTGAGATTCAGCCTAAAAAGCTGTTTACAGGACAGAACTCCATGGAGCTGGTAAGGAGTAGTCCAATATCTAGTGAACTATCTCAAACATCAGGTTTTGATATTGGCAGTATTGATGGAATTAGCTCTTCTTGTTCATGGGCTAATGCTTTGATTTCAGAACCTGCACATTCTAGAGACAGAAAACTTAGGAATCCAATGGAACACAAGGCGATTACAGTTCCAGAAATGAGATTAATGGACGATTTTGTTGAGATGGAAAAATTAGCCTTAGTTTCTGGAGGTGGATATAATCCAGTATCAGATGGTGAGGGATTGCTTCCATTTGGGCAGGGTCACTGTGGTTTCAGCAACACAAAACAGATTCACTCAAGAGATGTAGCAGCTGAAAGATCTTTTGATTGGCTTCAGGTTGTTTTGCATGCTATCTCCGAGCACAAACGTATTTCCAACCGAAGTTTAGATGAAATTCTCGAGGACATCAAAATTGCTTTGGGTTGTAGCACTCTTTTAACTGATGGTGATGTTAGTAAAACAGCATGCTCAATGCATCCAATAGAATCTGATGCTCTCCATATCAGTGGCTACATAGGCTGGAAATCTCCAAACACATCTCCTTCTGTAGGTTCACTCAGTGGAGCTTCTACTGTTGAGAACTCAGCAGAAAAAACAAAGAAGCAACAGTTTCAATCTAATCTGAGCAAGTCAATCTCTAAAATTGTTGAGCTGATTGAAGGAATTGATCTAACATCTTACAATACCTCTAGCAGTTGTCTAGAGAGAGATCAAAGTCCCAAACAAGCAGTAGCTCATGCAGATTACTTTGTCCGTGTTTTTCAATGGAAAAGTTCTGAATTAAGCACTGTTCTACAACAATTTCTTCGCACTTGCAATGATCTGTTGAATAAAAGGGCTGATCTTGAAAATTTTGCTGGAGAACTCTCATTTGCTTTGGACTGGATGTTGAACAACTGTGTTACTCCTAAGGAAGCCTCAAGTGCAAGGGATAAGATCAAAAGGCATTTTGGTTGGATTGAATCACAAAATGATAAAGACGTTGGTTCTGAAGGGAATGTTTTAGTATTGGAACCAGATGTGATCCATATTTCTGAAGAACAATCATCATGCTTAGGCTCCTTTGCTTCTTCACATGACCAGAATCTGAATGTTATCTCCGAAAAGGAGGGTATTCAGTGTAGCTTGGAAGAAGAAAACAAGAGATTAAAAGATGATTTGAAGAATATGGAAGCCAGGCTGGAGTCAGCAACTGACAAGAGTGAGGCCTTGACAGTACAACTTCATGAATCAGAACAAAGCATTGGAAGCTTACAAACAGAACTCAAGATATCAAAAGAAACAAAGGAAATGATTGAGGATCAGGTTGAAAATCAGAAATCAATTAATGAAGATCTTGATACCCAGCTTACAGTTGCAAAAGCTAAACTGAATGAAATCTTCCAGAAGTGCTCATCTTTGGAAGTTGAGTTGGAGTACAAAAATAACTGTTGTGAAGAGCTAGAAGCTACATGTCTTGAGCTTCAACTCCAGTTAGAGAGTGTGGCAAGGAAAGAAACACCAAAGTATGTCATGAATCGAGAGGGGAAGCAATCTCAAAACGGTTGGGAAATTACAGCAGCTTCAGTGAAGTTGGCAGAGTGCCAAGAAACAATTCTGAACCTAGGTAAGCAATTAAAGGTGTTGGCTTCACCACAAGATGCAGCACTCTTTGACAAGGTTTTCTCCAGCAGTGGTGCCGCCACCACTGTAATAAATAACAGAAGGGTGAACAGACGCTTCTCCTTGCGTGATCGAATGCTAGCTGAGGATGGTTCTAAAGCAGAGGTTCACAAGTCTCCCAATATTAGAGGAACTTTAAGCATTGGAGAAGCAGAGAATTCATCCCTTCCTGACTCCAATAATTGTAAGAACTTGCAAGCTTCTGGTTTGGTGGTAAATACTTCAGAAGCACATCTTGGTTCCAAGAAGGAAGGTACTAACACTGCAGTTATGGCTTTGGCTATTGTGCCAAGCAAGAAGCAAGGAGTTGGTTTGCTAAGGAGGCTGCTGTTAAGAAGGAAGAAAGGTTACAGTAAGAAATCTCATTACCAAAAGACTGACTAG |
Protein: MDHKTWLWRKKSSEKTIVATDKVDMSLKRIDEEVQMPPMEGPRDRIVKNLNEKLASVLLDCHAKEDLVTKNVKMAPEANAGWEKAEADAIFLKKELEEALRQGKLANEKLTRSDAALKECMQQLNFFREEQEQRMRDAIMKTSSEFEKAQESLQDKLTETNRRLEELVVENSRLSKALLVKEKLIEDQQKHKSQAEAEFGALMARLDFTEKENTFLKYEFHVLEKELEIRNEEMEYNRRSADLAHKQHLDGVKKIAKLEAECQKLRLLLQKRLPGPAAVMKMKNEVEMLGRDKTELRRRKLNSTRDLIIRDSATENSPDNPTKNINLLLEQLRNVEEENRTLKEIMTKKNAQLQSSSLACSQTLSRPTQVEIQPKKLFTGQNSMELVRSSPISSELSQTSGFDIGSIDGISSSCSWANALISEPAHSRDRKLRNPMEHKAITVPEMRLMDDFVEMEKLALVSGGGYNPVSDGEGLLPFGQGHCGFSNTKQIHSRDVAAERSFDWLQVVLHAISEHKRISNRSLDEILEDIKIALGCSTLLTDGDVSKTACSMHPIESDALHISGYIGWKSPNTSPSVGSLSGASTVENSAEKTKKQQFQSNLSKSISKIVELIEGIDLTSYNTSSSCLERDQSPKQAVAHADYFVRVFQWKSSELSTVLQQFLRTCNDLLNKRADLENFAGELSFALDWMLNNCVTPKEASSARDKIKRHFGWIESQNDKDVGSEGNVLVLEPDVIHISEEQSSCLGSFASSHDQNLNVISEKEGIQCSLEEENKRLKDDLKNMEARLESATDKSEALTVQLHESEQSIGSLQTELKISKETKEMIEDQVENQKSINEDLDTQLTVAKAKLNEIFQKCSSLEVELEYKNNCCEELEATCLELQLQLESVARKETPKYVMNREGKQSQNGWEITAASVKLAECQETILNLGKQLKVLASPQDAALFDKVFSSSGAATTVINNRRVNRRFSLRDRMLAEDGSKAEVHKSPNIRGTLSIGEAENSSLPDSNNCKNLQASGLVVNTSEAHLGSKKEGTNTAVMALAIVPSKKQGVGLLRRLLLRRKKGYSKKSHYQKTD |